Computer and Modernization ›› 2010, Vol. 1 ›› Issue (11): 9-11,1.doi: 10.3969/j.issn.1006-2475.2010.11.003

• 算法设计与分析 • Previous Articles     Next Articles

Semantic-based Automatic Text Classification Method

HU Xiao-hui, XU Ye-ke, LIU Bin   

  1. Department of Information & Management Engineering, Jiangxi Vocational College of Mechanical & Electrical Technology, Nanchang 330013, China
  • Received:2010-06-01 Revised:1900-01-01 Online:2010-11-25 Published:2010-11-25

Abstract: Automatic text classification is defined as the task to assign pre-defined category labels to documents.Based on the limitations of Vector Space Model, the Vector Space Model is incapable of expressing the structure of documents effectively.To solve this problem,this paper constructs the sireilar matrix by train text, and achieves the subject information of each category through similar matrix, and then to construct the classifier by the subject information.Finally the classifier is combined with the classic classifier to determine the category of text.The experiment system
shows the effectiveness of the method.

Key words: text classification, semantic features, VSM, graphical model, algorithm

CLC Number: